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1. Introduction 



For the author, one of the main motivations for developing the theory of 
stochastic partial differential equations (SPDEs) is its relation to the filtering 
problem for partially observable diffusion processes. 
This problem's setting is as follows. 
k*" | Let (f2, J 7 , P) be a complete probability space with an increasing filtration 

{Ft,t > 0} of complete, with respect to (.F, P), a- fields Tt C T. Denote 
\ by V the predictable cr-field in Q x (0, oo) associated with {Ft}- Let d > 1, 

d\ > d, and cfo > d\ be integers and wt be a ^-dimensional Wiener process 
with respect to {Tt}- Let K, T, 6 > be fixed finite constants. 
| Consider a di-dimensional two component process zt = (xt,yt) with xt 

being <i-dimensional and yt (di — (i)-dimensional. We assume that z t is a 
diffusion process defined as a solution of the system 



dx t = b(t,z t )dt + 8(t,zt)dw t , 
dy t = B(t,zt)dt + e(t,y t )dw t ' ' ' ' 

with some initial data. 

The coefficients of (jl.ip are assumed to be vector- or matrix-valued func- 
tions of appropriate dimensions defined on [0, T] x M. dl . Actually Q(t,y) is 
assumed to be independent of x, so that it is a function on [0, T] x 
rather than [0, T] x R rfl but as always we may think of 0(i, y) as a function 
of (t, z) as well. 

The component x% is treated as unobservable and yt as the only observa- 
tions available. The problem is to find a way to compute the density nt(x) of 
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the conditional distribution of x t given y s ,s <t. Finding an equation satis- 
fied by -Kt (filtering equation) is considered to be a solution of the (filtering) 
problem. Filtering equations turn out to be particular cases of SPDEs. 

The history of filtering equations for diffusion processes is long and its 
beginning is controversial. Probably, the first filtering equations were pub- 
lished in |St60] . They turned out to be plain wrong. Then in [Ku64| other 
equations were proposed, see for instance equation (5) of [Ku64j. However, 
it is hard to make sense of these equations because most likely some terms 
appeared from stochastic integrals written in the Stratonovich form and the 
others appeared from the Ito integrals. Perhaps, the author of [Ku64] real- 
ized this too and published an attempt to rescue some results of [K u64] in 
[Ku67] . This attempt turned successful for simplified models without the 
so-called cross terms. 

Meanwhile, in |Sh66j the correct filtering equations in full generality, yet 
assuming some regularity of the filtering density, were presented and then 
in [LS6 8] they were rigorously proved. This is the reason we propose to call 
the filtering equations in the case of partially observable diffusion processes 
Shiryaev's equations and their particular case without cross terms Kushner's 
equations. 

In case d = 1 the result of [Sh66] is presented in [LS01| on the basis 
of the famous Fujisaki-Kallianpur-Kunita theorem (sec [FKKJ) about the 
filtering equations in a very general setting (much more general than in 
[LS68J ) . Some authors even call the filtering equation for diffusion processes 
the Fujisaki-Kallianpur-Kunita equation. 

By adding to the Fujisaki-Kallianpur-Kunita theorem some simple facts 
from the theory of SPDEs, the a priori regularity assumption was removed 
in |KR78] and under the Lipschitz and uniform nondegeneracy assumption 
the I^-version of Theorem 12.61 was proved. The basic result of [K R78| is 
that -Kt £ H\. It is also proved that if the coefficients are smoother, irt(x) 
is smoother too. The nondegeneracy assumption is removed in [R90] on the 
account of assuming that 99* is three times continuously differentiable in x. 
It is again proved that -Kt £ H\ an d vr^ is even smoother if the coefficients 
are smoother. 

In [K99] the results of [KR78] were improved, 99* is assumed to be twice 
continuously differentiable in x and it is shown that -Kt 6 Hp with any p > 2. 

The above mentioned results of [KR78]. [R90], and [K99] use the filtering 
theory in combination with the theory of SPDEs, the latter being stimulated 
by certain needs of filtering theory. It turns out that the theory of SPDEs 
alone can be used to obtain the above mentioned regularity results about 
TTf without knowing anything from the filtering theory itself. It also can be 
used to solve other problems from the filtering theory. 

The first "direct" (only using the theory of SPDEs) proof of regularity of 
TTt is given in [KZOO] in the case that system (jl.ip defines a nondegenerate 
diffusion process and 99* is twice continuously differentiable in x. It is 
proved that -Kt € Hi with any p > 2 as in [K99J. Advantages of having 
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arbitrary p are seen from results like our Theorem 12.71 Of course, on the 
way of investigating TTt in [KZOO] filtering equations are derived "directly" 
in an absolutely different manner than before (on the basis of an idea from 
[KR81]). 

In this article we relax the smoothness assumption in |KZ00] to the as- 
sumption that the coefficients of (jl.lh are merely Lipschitz continuous, the 
assumption which is almost always supposed to hold when one deals with 
systems like (jl.ip . We find that 7r t £ Hp. Thus, under the weakest smooth- 
ness assumptions we obtain the best (in the author's opinion) regularity 
result on Hf In particular, we prove that if the initial data is sufficiently 
regular, then the filtering density is almost Lipschitz continuous in x and 
1/2 Holder continuous in t. However, we still assume zt to be nondegen- 
erate. Our approach is heavily based on analytic results. There is also a 
probabilistic approach developed in [Kn97] and based on explicit formulas 
for solutions initiated in |Pa79] and later developed in [KR81] and [Kn82| 
(also see references therein). This approach cannot give as sharp results as 
ours in our situation. 

It seems to the author that under the same assumptions of Lipschitz con- 
tinuity, by following an idea from [K79J one can solve another problem from 
filtering theory, the so-called innovation problem, and obtain the equality 

s < t} = a{w s , s < t}, 

where wt is the innovation Wiener process of the problem (its definition is 
reminded in Section [2]) . Recall that for degenerate diffusion processes the 
positive solution of the innovation problem is obtained in [Pu84] again on 
the basis of the theory of SPDEs under the assumption that the coefficients 
are more regular. 

By the way, in our situation, if the coefficients are more regular, the filter- 
ing equation can be rewritten in a nondivergence form and then additional 
smoothness of the filtering density, existence of which is already established 
in this article, is obtained on the basis of regularity results from [K99J. 

The article is organized as follows. In Section [2] we state our main results 
part of which is proved in the same section. In Sections [3] and 0] we prove 
Theorems 12.61 and 12.81 respectively. Section [5] contains a collection of results 
from the theory of SPDEs which we use in the previous sections. 

As it is done traditionally in filtering theory we consider finite-dimensional 
driving Wiener processes. However, our results will be based on the theory 
of SPDEs, outlined in Section (5J with countably many Wiener processes. 
We leave to the reader to do some trivial modifications in Section [5] in order 
to be able to apply its results in such cases. 

2. Main results 
First we state and discuss our assumptions. 
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Assumption 2.1. The functions b, 9, B, and O are Borel measurable and 
bounded functions of their arguments. Each of them satisfies the Lipschitz 
condition in z with constant K G (0, oo). 

Introduce 

v ' ty ' dz l dzi tK 1 dz % K ' 

where 9* is the transpose of 9 and the summation convention is imposed. 

Remark 2.1. System of equations (jl.ip can be now written as 

dz t = b(t, z t )dt + 6(t, z t )dw t . (2.3) 

Assumption 2.2. The process z% is uniformly nondegenerate: for any A, z G 
M dl and t G [0, T] we have 

a^'GOA'A' > <5|A| 2 . 

Traditionally, Assumption 12.21 is split into two following assumptions in 
which some useful objects are introduced. These assumptions were also used 
in the past to reduce 9 to the so-called triangular form by replacing wt with 
a different Brownian motion. 

Assumption 2.3. The symmetric matrix 00* is invertible and 

is ■= (ee*) _ s 

is a bounded function of (t,y). 

Remark 2.2. Assumption 1 2 . 3 1 follows from Assumption [22] and, furthermore, 

Assumption 2.4. For any £ G M d , 2 = (a;,y) G R x , and i > 0, we have 

\Q(t,y)9*(t,z)£\ 2 >5\tf, 
where Q is the orthogonal projector on Ker0. In other words, 

{9(1- 9*# 2 G)0*£,£) > 5\£\ 2 . (2.4) 

Remark 2.3. From (12. 4p we see that 99* is uniformly positive definite with 
constant of positivity 5. Also, it turns out that (j2.4jl holds under Assump- 
tion E2J 

Indeed, take a( = (£,^7?) G R d x with r? = -*e<9*£ and observe 

that 

2<5|e| 2 < 2(5C, C) = \0*C\ 2 = \9£\ 2 + 2(9*£, 6*^r/) + |e*^r?| 2 

= |0£| 2 + 2(VQ6*Z,rj) + \t]\ 2 = |0£| 2 - |*e0*f | 2 , 
which is even stronger than (|2.4|) . 
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Remark 2.4. We have seen that Assumptions 12.41 and 12.31 follow from As- 
sumption 12. 2[ In turn Assumptions 12.41 and 12.31 in combination with As- 
sumption [24] imply Assumption 12.21 perhaps with a different constant in the 
latter. 

To show this, we take C = (£, r/) G R d x R dl ~ d and observe that 
2(aC C) = (0e% £) + 2(&0% r,) + (98*7?, rj) 

= |rei 2 + 2(^ere, v) + e% n) + (i - e)(ee*r/, r?) 

where r) = ^ _1 r/, and e G (0,1). By using the inequality 2(fi,v) + e\fi\ 2 > 
— e _1 |^| 2 we see that 

and by taking N such that * < N(5 ij ), for which 86* > N~ 2 (5 ij ), we 
conclude 

2(fiC, > l#*£| 2 - e _1 |*er^| 2 + (1 - e)N^ 2 \r]\ 2 

> 6\£\ 2 + (1 - e- 1 )|*ee*^| 2 + (1 - e)N- 2 \7]\ 2 , 

where the last inequality follows from (|2.4p . Finally, ^Q9* is a bounded 
function, so that, for a constant N%, 

2(aC, C) > (5 + Aq(l - e- 1 ))^ 2 + (1 - e)JV- 2 |^| 2 . 

For e sufficiently close to 1 the last expression is greater than <5i|£| 2 with a 
constant Si > 0, which is equivalent to the uniform ellipticity of a. 

Before stating the next assumption we remind the reader that, for 7 G R 
and u G C^°(R d ) one introduces (1 — A)~ 7 / 2 u by means of the Fourier 
transform. Then, for p G (l,oo), one defines the spaces of Bessel potential 
Hp (R d ) as the set of distributions obtained as the closure of C^°(R d ) with 
respect to the norm 

IMIfl^R") := IK 1 - A y /2u \\L p (Rd)- 
One important and highly nontrivial piece of information is that 
Hl(R d ) = W p (R d ) := {u G L p (R d ) : Vu G L p (R d )} 

and 

~ ll u llw p 1 (R' i ) := ll M llLp(M d ) + ll^llip^)- ( 2 - 5 ) 

Assumption 2.5. The random vectors xq and yo are independent of the 
process wt- The conditional distribution of xq given yo has a density, which 
we denote by vro(x) = ttq(lo, x). We have p > 2 and ttq G L p {Q., H p ~ 2 ^ p (R d )) 
(actually, we need slightly less, see Remark 13. ID . 

Next we introduce few more notation. Let 
y t = V(t,yt), e t = e(t,y t ), a t (x) = -ee*(t,x,y t ), b t (x) =b(t,x,y t ), 
a t (x) = 9(t, x, y t )@t^t, Pt(x) = 9 t B(t, x, y t ). 
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In the remainder of the article we use the notation 

i dx % 

only for i = 1, d and set 

L t {x) = 4 j (x)D i D j + &j(x)A , (2-6) 
L* t (x)u t (x) = DiD^a* (x)u t (x)) - Di(b\(x)ut(x)) 
= D j (a^{x)D i u t {x) - bi(x)u t (x)+u t (x)D ia f(x)), (2.7) 
A k (x)u t (x) = ${x)u t (x) + al k {x)D lUt {x), (2.8) 
Af (x)ut(x) = & fc (xH(x) - (x)ut(x)) 
= -af{x)D iUt {x) + (f3 k (x) - D^f (x)H(x), (2.9) 

where t £ [0,T], x £ R d , k = 1, di — d, and as above we use the summa- 
tion convention over all "reasonable" values of repeated indices, so that the 
summation in (|2.6j) . (12. 7ft . (12, 8j) . and (12, 9ft is done for i,j = l,...,d (whereas 
in (|2.2p for i, j = 1, ...,d\). Observe that Lipschitz continuous functions have 
bounded generalized derivatives and by 

we mean these derivatives. Prom Remark 12.31 we have that the operator L 
defined by (|2.6|) is uniformly elliptic with constant of ellipticity 5. 

Finally, by J-f we denote the completion of o{y s : s < t} with respect to 

Let us consider the following initial value problem 

d7f t (a?) = L* t (x)Tt t (x) dt + A k *(x)it t (x)* kr dy r t , (2.10) 
vf (x) = vr (x), 

where t £ [0,T], x £ M. d , and 7t t (x) = 7tt(oo,x). Equation (|2.10p is called the 
Duncan-Mortensen-Zakai or just the Zakai equation. 

We understand this equation and the initial condition in the following 
sense. We are looking for a function 7? = Tt t (x) = n t (u,x), u> £ Cl, t £ [0, T], 
x £ R d , such that 

(i) For each (u>,t), Ttt(u!,x) is a generalized function on M. d , 

(ii) We have ff £ L p (Q x [0, T], V, H$ (R d )), 

(iii) For each ip £ Cfi°(R d ) with probability one for all t £ [0,T] it holds 
that 

("7f t , ip) = (ir , (p) - / (djDiitt - 6j 7ft + n t Dia/ ,Dj(p) dt 
Jo 

- [ (of Dm + (Diaf - P k )7t t , tp)*? (B r (t, z t ) dt + Q rs (t, y t ) dw?) , (2.11) 

J 

where by (/, ip) we mean the action of a generalized function / on (p, in 
particular, if / is a locally summable, 

(/, v) = / f(x)ip(x)dx. 

JM. d 
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Observe that all expressions in (|2.1ip are well denned due to the fact that 
the coefficients of tt and of Diit are bounded and appropriately measurable 
and 7f, At 6 L p (tt x [0, T ],V, L p (R d )) (see 

Hence, equation (|2.10p has the same form as (|5.1j) and the existence and 
uniqueness part of Lemma 12.51 below follow from Theorem 15.11 and Remark 
13. 1L The second assertion of the lemma follows from Theorem 15. 41 

In all what follows in the main part of the article we suppose that As- 
sumptions I2.H 12.21 and 12.51 are satisfied. 

Lemma 2.5. There exists a unique solution ff of (|2.10p with initial condi- 
tion ttq in the sense explained above. In addition, n t > for all t € [0, T] 
(a.s.). 

Here is a basic result of filtering theory for partially observable diffusion 
processes. Its relation to the previously known ones is discussed above. 

Theorem 2.6. Let n be the function from Lemma \2. 51 Then 

0< / W t (x)dx = (7f t , 1) < oo (2.12) 
Jw. d 

for all t € [0, T] (a.s.) and for any t £ [0, T] and real-valued, bounded or 
nonnegative, (Borel) measurable function f given on M. d 

E[f(x t )\^] = r h ^: (as.). (2.13) 
Equation (|2.13p shows (by definition) that 

n{x) := (^i) 

is a conditional density of distribution of xt given y s , s < t. Since, generally, 
(7ft, 1) 7^ 1, one calls 7ft an unnormalized conditional density of distribution 
of xt given y s ,s <t. 

The following is a direct corollary of Theorem 15.51 



Theorem 2.7. Let -kq be a nonrandom function and ttq € Hp 2 ^ p (R d ) for all 
p > 2, which happens for instance, if hq is a Lipschitz continuous function 
with compact support. Then for any e £ (0,1/2) almost surely Ttt(x) is 
1/2 — e Holder continuous in t with a constant independent of x, itt(x) is 
1 — e Holder continuous in x with a constant independent oft, and the above 
mentioned (random) constants have all moments. 

In filtering theory usually the following theorem is proved before anything 
else is done. We do not need it for proving the above results and give the 
proof just to show that the L p -theory of SPDEs allows one to get all basic 
results from filtering theory. 

Historically, Pt[(3] was introduced by (I2.16P and shown to have (a modi- 
fication possessing) appropriate measurability properties. Then 7ft used to 
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be defined as the density of conditional distribution of x t given T\ divided 
by an appropriate modification of 

E{ P t | n\ (2-14) 

where 

p t = exp(- j3 s dw s - \ / \fl s \ 2 ds), (3 S = (3 s (x s ), w t = ^ s Q s dw s . 
Jo Jo Jo 

In this case (ffj, turns out to be this same appropriate modification of 
flgH]) (cf. our dS^Pp . 

The most surprising statements in Theorem 12.81 are assertions (iv) and 
(v). In (iv) the difference of two Wiener processes wt and wt (that the latter 
is a Wiener process is checked in the proof of Lemma I3.3P is asserted to be 
a differentiate nontrivial function. 

Assertion (v) shows that (|2.14p . which is a conditional expectation of 
a martingale, is again a martingale and, moreover, while evaluating it we 
can just put conditional expectations of (3 S given T y s in place of j3 s in the 
expression of pt with simultaneous replacement of w with w. 

Theorem 2.8. (i) The process (vft,l) is continuous in t (a.s.) and (a.s.) 
for all t € [0, T] 

(vf t , 1) = (7ro,l) + [\7t s ,P^ k /B r (t,z s )ds+ f\7r s ,P^ r @ rn (t,y s )dw^ 
Jo Jo 

(2.15) 

(ii) The process 7r t is a continuous L\-valued process (a.s.). 
(Hi) Introduce P t [(3] = (P*^ 1 ], P t [/3 dl ~1) by 

P t [l3] = (tr,!)- 1 / p t (x)7t t (x)dx = (Tt t ,l)- 1 ^(t,y t ) [ B(t,x,yt)n(x)dx. 



Then Pt[f3] is a jointly measurable bounded J-f -adapted process on [0, t] (a.s.) 
and for each t E [0, T] 

P t [(3] =E((3 t {x t ) [J*) (a.s.). (2.16) 

(iv) The process 

w t = w t + [ G9,(ar a ) - P a [P]) ds 
Jo 

is a (d\ — d)- dimensional Wiener process with respect to J-f (the so-called 
innovation process), where 

w t = ^ s O s dw s . 



o 



(v) We have (a.s.) for all t G [0, T] 

(7f,l)=exp( f P s [fi]dw s + \ f \P s [f3]\ 2 ds), (2.17) 
Jo Jo 
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SO that ^ 

(vf, l)- 1 = exp ( - f P s [l3)dw s -\! \P s [(3]\ 2 ds) 
Jo Jo 
is an exponential martingale, and for each m > 

E sup(7f , l) m + E sup(7f , 1) — m < oo. (2.18) 

t<T t<T 

3. Proof of Theorem 12.61 

We will use some notion and results from the theory of SPDEs, which are 
recalled in Section [5j From now on we drop M d in notation like Hp(R d ) and 
L p (R d ). 

Remark 3.1. The assumption that ttq € L p (Q,Hp 2 ^ p ) is only needed to 
guarantee (see the proof of Theorem 5.1 of [K99]) that there exists a tp £ 
TCp(T) such that = ^O) 

drp t = Aifj t dt = Dift dt, {f t = D^ t ), 

W\l UT) <NE\\no\\ P Hr/p 

with N independent of ttq. 

As is mentioned before Lemma 12.51 by Theorem 15.11 and Remark 13.11 
there exists a unique solution tt G TLp(T) of (|2.10p with initial condition ttq. 
By Theorem 15.41 Ttt > for all t € [0,T] (a.s.). By Theorem 15.51 7ff is a 
continuous L p -valued process and 

E sup ||vft||^ dt < oo. (3-1) 

te[o,T] p 

Now, we prove three auxiliary results. 

Lemma 3.2. Let£t,€'t, n = 1> 2, t 6 [0, T], 6e k-dimensional continuous 
semimartingales such that, for any t £ [0, T], £™ — > ^ in probability as 
n — > oo. Assume that 

£t=Q+ [ a n s ds + rtf, & = fo + / a s (is + mi, 
Jo io 

where at and oil are predictable processes bounded by the same nonrandom 

constant and mt and m™ are martingales such that 

{m n \m n i) t = l\f j ds, {m\m?) t = f ds, i,j = 1, ...,k, 
Jo Jo 

where 7™ := (7™^) and ~ft '■= (it) are predictable matrix-valued processes 
bounded by the same nonrandom constant and such that (7") -1 and (7t) _1 
exist and are also bounded by the same nonrandom constant. 

Assume that on [0, T] xMJ xM. k we are given functions ft{x, y) and ft(x, y) 
such that they are uniformly bounded and f n — > / in measure as n — > 00. 

Then CD — * ft{x,£t) in measure on Q, x [0, T] x R l . 
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Proof. It suffices to show that any subsequence {n'} of integers has a 
subsequence {n"} such that /" (x, ) — > ft{x,€t) hi measure. Since any 
subsequence {n'} has a subsequence {n"} such that f n — > / almost every- 
where, by having in mind renumbering if needed, we may assume that for 
the original sequence we have / n — > / almost everywhere. In that case for 
almost any x € R , ft(x, y) — ► /t(x, y) and, if we prove that for each such a; 
we have ft(x,^) —> ft(x,£t) i n measure on Q x [0,T], then 

Jo 

which after being integrated with respect to x would shows that /"(x, £") — > 
ft(x,£t) i n measure on x [0,T] x R l . 

It follows that we only need to prove that, if on [0, T] x R k we are given 
functions f™(y) and ft(y) such that they are uniformly bounded and f n — > / 
(t, y)-almost everywhere as n — > oo, then 

# f T \m?)-ftmdt^o. (3.2) 

JO 

Furthermore, since the coefficients a n , a, 7™, and 7 are uniformly bounded 
sup sup P(|a + |6| > R) < R~ 2 sup sup E{\Q\ 2 + |6| 2 ) - 

n te[0,T] n tG[0,T] 

as i? — > 00. Therefore, if for any R £ (0, 00) we know that (13. 2D is true 
provided that ft(y) and vanish for |y| > .R, then by applying this 

result in the general case to ft{y)I\ y \<R and ft{y)I\ y \<R we would obtain 
that 

>-2 



lim E [ \fr(Ct)-m)\dt<NR- 



where A?" is independent of R. This would imply f|3.2|) in the general case. 
This shows that without restricting generality we may assume that for an 
R € (0, 00) the functions /"(?/) and ft(y) vanish if \y\ > R. 

Now observe that the left-hand side of (|3.2[) is majorated by I n + J n , 
where 

i n = E [ T i/r(cr) - m?)\ dt, j n = E [ T \m) - m)\dt. 

Jo Jo 

We recall a result of |K77j implying that for any g € L^. +1 ([0, T] x M fc ) we 
have 

E [ (\gt(&)\ + \9t(Ct)\)dt < N\\g\\ L {[0>T]xm , 
Jo 

where N is independent of n and g. We apply this result to g = f n — f and 
observe that these functions are uniformly bounded, vanish for \y\ > R, and 
tend to zero in measure. Hence, their Lk+i([0, T] x M fc )-norms tend to zero. 
This implies that I n — > 0. 
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Next, notice that for any function g 

J n <E [ T \gt(&)-gt{Zt)\dt 
Jo 

+E f \f t {Q) - g t ($)\dt + E [ T j/ t (6) - g t m\dt 
Jo Jo 

implying that 

Ikn" J n < lim" E [ \gt($) - gtfa)\dt + N\\f - ff|| Lfc+l([ o, T ]xM^), (3-3) 

where N is independent of g. For any e > we can find a smooth g such 
that the second term on the right in (|3.3p will be less than s. In addition, 
the first term vanishes for smooth g since £™ — > £t in probability for any t. 
Since e is arbitrary, it follows that the left-hand side of (|3.3|) equals zero. 
The lemma is proved. 

The following result with its proof is an adaptation of Lemma 5.1 of 
[KZOO] and its proof. 

Lemma 3.3. The function wt is -adapted. 
Proof. Define 

Pt = Pt(xt) = ^tB(t,z t ), w t = [ ^ s dy s , w t = f y s Q s dw s . 

Jo Jo 

Since ^ is ^-adapted, the process w t is T\ -adapted too. Furthermore, 
'I'sOs©*^ is a unit matrix so that by Levy's theorem wt is a Wiener process. 
We want to change the probability measure so that wt would become a 
Wiener process with respect to this new measure. Define 

pt = exp(- f S dw s - | / \P S \ 2 ds), Q(dw) = P t(oj) P(duj). (3.4) 
Jo Jo 

The process pt is an exponential local martingale. Since $ is bounded, pt is 
square integrable, so that Q is a probability measure. Since 

dwt = Ptdt + dwt 

and wt is a Wiener process on (0,.F, P), by Girsanov's theorem, Wf, t £ 
[0, T], is a Wiener process on Q) with respect to the filtration {Ft}- 

As has been noticed before, it is ^-adapted and, obviously, 

so that (wt,J-t) is a Wiener process. Now rewrite (I2.10j) as 

dn(x) = L* t (x)itt{x) dt + A t fe *(x)Tt t (x) dw^, (3.5) 

and consider this equation relative to (O, J 7 , Q). 

By Theorem 15.11 and Remark 13.11 equation 13.51 with initial data ttq has 
a unique .Ff-adapted solution belonging to H^J 7 ? ,Q,T) C TLp{T.,Q,T), 
where by li} p (T v ,Q,T) we mean the space Hp(T) constructed on the basis 
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of the new probability measure Q and filtration T v . We denote by 7ft this 
solution. 

We have already mentioned that Tt € TCp(J-., P,T). We want to derive 
that 7ft is Ff-adapted from the uniqueness by showing that tT = tt because 
both are J-j-adapted solutions of the same equation. The only obstacle is 
that the norms in H^iT. , Q, T) and TLp{T) are different. To overcome this 
obstacle, we are going to use stopping times. 

For integers n define 

r (n) = T A inf{t > : f ||7r s ||*L ds > n}. 

Jo p 

Obviously, r(n) are .Ff -stopping times and Ft-stopping times. Furthermore, 

/•r(n) 

MIut p , « = P / ||7f a ||P ! ds < n < 00. 



lJ(^.,P,r(n)) -^y "^N^i 

This and the equation (cf. (13. 5h ) 

dTTi(x) = [Lj(ar)7T t (a?) + ffi A k (x)n t (x)] (it + A t fc (x)7i i (x) cte; t fc 

show that, n S 7^p(F., P, r(n)). By the above mentioned uniqueness, 7ft = 
7f t on (0,r(n)] (a.e.). Since both functions are continuous in t € [0, T] 
(Theorem 15.51 (i)). we have that 

Kth<t<T{n) an d ^tlQ<t<r(n) 

are indistinguishable, and since one of them is F t y -adapted, so is the other. 
We conclude that 7f 4 / <t<T(n) is F,F-adapted, which after letting n — > 00 
yields the result. The lemma is proved. 

Assertion of the following lemma is a very particular case of one of the 
assertions of Theorem 12.81 Before stating the lemma we recall that fft > 
for all t £ [0, T] (a.s.), so that (7ft, 1) is well defined (and may be infinite). 



Lemma 3.4. We have 



E sup (vf t , 1) 1/2 < 00. (3.6) 

te[o,T] 



Proof. For ip E Co°(M d ) one can rewrite f|2. 11 j) as 

("7f t , <p) = (7T , <p) + / (tt s , L s p)ds 



+ f{K s ,k k s ^ k ;{B r {s,z s )ds + Q™{s,y s )dw n s ). 
Jo 



(3.7) 



Using (13. ip and an obvious passage to the limit, it is easy to prove that ([3 

7 2 
1 



holds not only for p G C^°(R d ), but also for p £ with q = p/(p — 1). 



On R for m = 1, 2, ... introduce the functions 

ip(x) = {I + \x\ 2 )~ d , p m (x) = p{x/m). 
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Observe that for a constant N it holds that 

\Dnp m \ + m\DiDj(p m \ < 2Vm~Vm (3.8) 

on R d for all m. In particular, 

2Lt<Pm < N <p m , 2\^ r B r (t, z t )A*ip m \ < N <p m , (3.9) 

where Nq is a constant independent of m and the arguments of the functions 
involved. 

By plugging in (|3,7p the function <p m in place of tp, we obtain 



/ (n s ,L s ip m )ds 
Jo 



+ f\7r s ,A k s <p rn )^ r (B r (s,z s )ds + e rn (s,y s )dw%). (3.10) 
J o 

By using Ito's formula for transforming 

fa,<Pm)e~ Not , (3.11) 

and using (|3.9[) we see that 

d[(Tt u Vm)e- Not ] = e~ Not (Tt t ,A^ m )^ r e rn (t,y t )dw? 

+e- Af °*[(vf i , L t <p m ) + (fr a , A^ w )*f S r (a, z 8 ) - A^ (vf s , </? m )] dt 

<e- JV ' ot (7f t ,A t Vm)*? r © rn (t,yt)d<- 
It follows that process (|3.1ip is a supermartingale. It is continuous and 
nonnegative. Therefore, 

E sup e~ Not ( [ ^ m 7r t (x)dx) 1/2 <2(E [ <p m w (x) dx) 1/2 < 2. 
te[o,T] JM d 

Upon letting m — > oo and using the monotone convergence theorem we come 
to (13.61) and the lemma is proved. 

Proof of Theorem 12.61 Take a nonneg ative C £ C^(R dl ), which inte- 
grates to one and for n = 1,2, ... set 

Uz)=n d ^(nz). 

Also introduce mollifications of one of the coefficients of (jl.ip by 

e^(t,z) = ( n (z)*e(t,z), 

where the convolutions is taken with respect to z. 

The function £ can be considered as the density of a random variable. If 
needed, we extend our initial probability space in such a way that it would 
allow us to introduce a new random M. dl -valued vector £ having density £ 
and such that £ is independent of zq and the process Wt, t > 0. 

After that, for n = 1,2, we consider the following modification of (jl.ip : 



= b(t, z[ n) )dt + 0(")(t, z{ n) 
dyl n) = B(t,z < t n) )dt + Q(t,y < t n) )dw t 



14 N.V. KRYLOV 

with initial data = xq + u~ 1 ^, yjf 1 ' = yo and = (x[ ,y[ ). Observe 

(n) 

that the conditional distribution of x given yo has a density equal to 

in) t 

Since 9(t, x, y) is Lipschitz in x (even in (x, y)) we have \9(t, z)—6^ n \t, z)\ < 
Nn^ 1 , where N is independent of n,t,z. This shows that system (|3.12p 
satisfies Assumption 12.21 for all large n. In addition 8^ possesses enough 
smoothness in order for the results of K/OOj to be applicable. For all large 
n, it follows that, for any smooth bounded and nonnegative function ct(y) 
on [0,T] x R dl ~ d and any <p G Cfi°(R dl ), 

^(4 n) )exp(- ( T cM n) )ds) 
Jo 

= EpV> I (^(x,#)4 n) (x)dxexp(- F c s (yW)ds), (3.13) 

JR d JO 

where 7fj is the solution of equation (|2.10p corresponding to system (|3.12|) 

with initial condition tt^ = ir^ and pf 1 is introduced as in (|3.4|) on the 
basis of (I3TT21) : 

pi n) = exp(- f ftp dwW - \ f | 2 da), 
Jo Jo 

Jo 

e^ = Q(t,yl n \ 9?> = 9{t,d n) ). 
Later on we will also use the following notation for other coefficients of 
equation (|2.10p corresponding to system (|3.12p . Introduce 

at\x) = ^e^>(t,x,yl n) ), b[ n \x) = b(t,x,y^), 

4 n) (x) = 9^(t,x,y^)ei n) *^ n) . 

Since we know that > 0, it follows from the validity of (|3.13j) for all 
if € C^°(K rfl ), that it is also valid for all Borel nonnegative or bounded (p. 
In particular, for any / G Cq d (W 1 ') (independent of y) we have 

£/(4 n) )exp(- f T c s (y^)ds) 
Jo 

= E P P f f(x)7tP(x)dxexp(- [ T c s (y^)ds). (3.14) 

JM. d Jo 

Our next step is to pass to the limit in (|3.14|) as n — > oo. It is a standard 
fact that for any m > 

lim £sup \zl n) - z t \ m = 0, (3.15) 
n-i-oc t<T 
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which, in particular, implies that the left-hand sides of (13.141) tend to 

cT 



(n) 

Furthermore, the process p\ is the solution of the linear equation 



with initial condition p^ = 1, where 



Ef(x T )exp(- / c s (y s )ds). 
Jo 

■s p[ is the solution oJ 

dp[ n) = -/>S n) 7t (n) dwt, 
= 1, where 

7* w =*(t^ n) )fl(*,4 n) ^(*,y5 Tl) )©(*,«i B) ) 

Also introduce 

7t = y(t,yt)B(t,zt)t>(t,yt)e(t,y t ) 

(n) 

and observe that the processes 7^ and jt are bounded. 
Furthermore, it follows from (|3.15p that for any m > 

lim J Esup| 7 ; n) - 7t r = 0, 

rwoo t<T 



which in turn implies that 



lim £sup|pi n) - Pt \ m = 0, 

where pt is the solution of the equation dpt = —ptlt dwt with initial condition 
Po = 1 and is given in (|3.4|) , 

To investigate the limit of the remaining factor on the right in (|3. 14)) we 
will use Theorem 15.21 By the well-known properties of convolutions 

II ("OllP ^ II IIP v el 1 1 ( n ) IIP n 

Iko llffi-a/p ^ IfoII^i-2/p, -mv m -2 /P = 0. 

This and Remark 13.11 show that the assumption of Theorem 15.21 regarding 
the convergence of the initial data for tt^ and irt is satisfied. Furthermore, 
there are no free terms in filtering equations. Therefore, it only remains to 
check the appropriate convergence of the coefficients. Theorem 15.21 requires 
the following convergences in measure P(dui)dtdx to hold on $7 x [0, T] x M. d : 

4 n) (x) - a t (x) , b[ n) (x) - b t (x) , D ia i n)ij (x) - (x) , 

al n \x)^* t (x), Pl n \x)^f3 t (x), D i al n)ik (x)^D i of(x). 

Relation f)3. 15|) and the assumption that the coefficients of system (jl.ip 
are Lipschitz continuous show that, actually, apart from cases involving the 
derivatives of a and a all the remaining convergences hold uniformly in (t, x) 
almost surely. It is easy to see that in order to take care of the terms with 
derivatives it suffices to check that 

A# (n) (£,*,y t (n) ) - D % 9(t,x,y t ) (3.16) 

in measure for any i = 1, ...,d. Observe that by the well known properties 
of convolutions 

D i e^(t,x,y)^D i e(t,x,y) 
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for almost all (t,x,y). Therefore, applying Lemma [3.21 shows that (|3. 16() 
holds. 

Now by Theorem 1 5 . 2 1 and Holder's inequality we conclude 

lim E\ f f(x)ir!p\x)dx - [ /(x)vf T (x) dx\ p = 0. (3.17) 

n— oo J Rd ■ J Rd 



This along with the above investigation of other terms in (|3.14j) yields after 
letting n — > oo that 

Ef(x T )exp(- c s (y s )ds) = Ep T (TtT,f)exp(- c s (y s )ds). 
Jo Jo 

The arbitrariness of c leads to 

E(f(x T ) | 7%) = E(pT(n T J) | Fl), (a.s.), 
which combined with the J^-measurability of ttt (Lemma I3.3P shows that 
E(f(x T )\J^) = (n T ,f)E(p T \J^) (a.s.). (3.18) 
Observe that on the set of to where 

E(p T \J^)=0 (3.19) 

we have (a.s.) 

E(f(x T ) | tfj) = 0. 

The arbitrariness of / shows that on the said set (a.s.) 

l = E{\\ F-f) = 

and consequently (|3.19p can only happen with probability zero. 

Furthermore, by Theorem 15.41 we have ff* > 0. A standard measure- 
theoretic argument then shows that (|3.18|) holds for all nonnegative Borel / 
rather than only for / € Cg°(IR d ). By taking / = 1 we see that 

1 = (tpt, l)E( PT I J%) (a.s.) 

implying that 

oo > (tt t , 1) > 0, E(p T | J%) = (ttt, l)" 1 (a.s.). (3.20) 
Coming back to (13.180 we conclude 

E[f(x T )\T y T ] = (a.s.) 

(.TTT, 1) 

for any nonnegative and any bounded Borel / as well. Obviously, one can 
replace here T with any t E [0, T] and to prove Theorem 12.61 it only remains 
to show that (a.s.) relation (|2,12[) holds for all t G [0, T]. 

The second inequality in (|2.12p holds due to Lemma 13.41 To prove the 
first one it only remains to observe that by the above for each particular 
t G [0, T] with probability one 

fff (x) dx>0 
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and by Theorem 15.51 the above integral is continuous in t with probability 
one. The theorem is proved. 



To prove (i) we first show that the right-hand sides of (|3.10p converge as 



n — > oo uniformly in t G [0, T] in probability to the right-hand side of (12.151) . 
Owing to (GLED and (["TBI) 



where N is the constant from (|3.8p . Similarly one takes care of the term 
with ds containing the derivatives of ip m in the second integral on the right 
in (I3.10P . Observing that by the dominated convergence theorem and again 



we conclude that the usual integrals on the right-hand sides of (|3.10p con- 
verge as n — > oo uniformly in t G [0, T] to the usual integral the right-hand 
side of (12.15P almost surely. 

To show the convergence of the stochastic integrals in (I3.10P to the sto- 
chastic integral in (|2.15p uniform in probability it suffices (and is necessary) 
to show that the quadratic variation of the differences converges to zero in 
probability. The said quadratic variation is obviously less than a constant 
times 



which tends to zero (a.s.) by the same reasons as above. Thus, indeed the 
right-hand sides of (|3.10p converge as n — ► oo uniformly in t G [0, T] in 
probability to the right-hand side of (|2.15p . The left-hand sides converge 
for all t G [0, T] (a.s.) by the monotone convergence theorem. This proves 



Assertion (ii) easily follows from the continuity of (7ft, 1), the continuity 
of 7ft as an L p -valued process, and Scheffe's lemma. 

In (hi) that Pt[f3] is bounded follows from the boundedness of (3. The 
stated measurability properties of Pt \j3\ are obtained by a standard measure- 
theoretic argument form the fact that if f(t,x,y) = a(t)P(x) r y(y), where 
a, f3, 7 are smooth functions with compact support, then 



4. Proof of Theorem 12.81 




\(tt s , L s ip m )\ds < NTm sup (tt s ,1) — >0 (a.s.) 

se[o,T] 



by O 






possesses the measurability properties in (hi) since the last factor is a con- 
tinuous (a.s.) ^"f-adapted process. 
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To prove (|2.16p it suffices to use (|2.13p which implies that for each t G 
[0,T] and y G R dl ~ d 



E(B(t, xt,y) | ?%) = (fr f , l)" 1 I B(t, x, y)w t 

JR d 



(x) dx (a.s. 



and then plug in here yt in place of y in the argument of B, which is possible 
because B(t,x,y) is Lipschitz in y (even in (x,y)). This finishes proving 
assertion (iii). 

In (iv) the fact that wt is .Ff-measurable easily follows from an equivalent 
formula for wt- 

w t = ^(s,y s )dy s - / P s [/3]ds, 
Jo Jo 

where all terms on the right are .^-measurable. Furthermore, wt turns out 

to be an jF t y -martingale on [0, T]. To check this, take any ^-stopping time 

t <T and notice that r is also an .^-stopping time, so that 



Ew T = E [ T (f3 t (x t )-P t [f3})dt. 
Jo 



By using (|2.16p and the fact that, by definition, {t < r} G J-f we see that 
the right-hand side equals 

E [ T I t< r(Pt(x t ) - Pt[P}) dt = f T EI t<T [3 t (x t ) dt - f T EI t<T P t [f3] dt 
Jo Jo Jo 



= f EI t<T f3 t (x t ) dt - f T EI t<T (E([3 t (x t ) | dt = 0. 
io Jo 

Thus, Ew T = for any .^-stopping time r < T which combined with the 

jF^-adaptedness of wt and its continuity in t is well known to be equivalent 

to saying that ibt is an ^-martingale on [0, T}. Its quadratic variation can 

be evaluated as the limit of sums of products of increments and is, obviously, 

equal to the quadratic variation of Wt, which, as we have seen in the proof 

of Lemma 13.31 is a Wiener process. Therefore, the quadratic variation of 

wt is that of a Wiener process and by Levy's theorem wt is itself a Wiener 

process with respect to T\ . This proves assertion (iv). 

In (v) inequality (|2.18j) follows from (|2.17j) . the fact that /3 is bounded, 

and the well-known properties of exponential martingales. To prove (|2.17j) 

observe that (12. 15H in terms of Pt [0\ and is rewritten as 

d(7t t , 1) = (7f t , ^)^(x t ) dt + (# tj $) dw k t 
= (7f tj l)P t \p\${x t ) dt + (7f t , l)P t [(3 k ] dw k t 

= {f u l)\P t [P]\ 2 dt + (7f t , l)P t [f3 k ] dw k . 
Hence, (vft, 1) satisfies the linear equation 

d(TT t , 1) = (7f t , l)\P t [P}\ 2 dt + (7f t; l)P t [P k ] dw k , 

the unique solution of which with initial data (ttq, 1) = (ttq, 1) = 1 is known 
to be given by (I2.17p . The theorem is proved. 
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5. Appendix 

The setting in this section is somewhat different from that of Section [TJ 
Let (£l,J-,P) be a complete probability space with an increasing filtration 
{Ft,t > 0} of complete with respect to (T, P) er-fields Tt C T. Denote V the 
predictable u-field in x (0, oo) associated with {Tt}. Let wf, k = 1,2, 
be independent one-dimensional Wiener processes with respect to {J~t\- 

We take a stopping time r and for t < r we are considering the following 
equation in M. d 

du t = (L t u t - Xu t + Difl + /°) dt + (A k u t + g k ) dw k t , (5.1) 
where Ut = Ut(x) = Ut(co,x) is an unknown function, 

L t ifj(x) = D j {a\ ] {x)D i ^(x) + a J t (x)t/j(x)) + £>j(x) A^0*0 + c t (x)t/j(x), 

A^(x) = <rj* {x)Diil){x) + ^ fc (x)V(x), 

the summation convention with respect to i,j = l,...,d and k = 1,2,... is 
enforced and detailed assumptions on the coefficients and the free terms will 
be given later. 
Fix a number 

P > 2 



and denote L p = L p (M ). We use the same notation L p for vector- and 
matrix- valued or else l%- valued functions such as gt = (g k ) in (|5.ip . For 
instance, if u(x) = (u 1 (x) , u 2 (x) , ...) is an ^-valued measurable function on 
R d , then 

\u(x)\ldx= / (^|^WI 2 ) P/2 ^. 



As above 



A = ^-, i = !,...,<*, A = Z>? + ... + .D;}. 



By Du and Z) 2, u we mean the gradient and the matrix of second order 
derivatives with respect to x of a function u on R d . 

As above, for 7 G R by i/p = (1 — A) _7 / 2 L p we denote the space of 
Bessel potentials. Observe a slight change of notation. Since we will always 
be dealing with M. d we drop this symbol in the notation like Hp(M. d ). Most 
often in this appendix we will use Hp for 7 = 0, 1 and use (]2.5f) . 

If r is a stopping time, then 

m;(t) :=L p ((0,rlV,H;), h p (r)=B° p (r). 

We also need the space H p (r), which is the space of functions ut = ut{u), •) on 
{(to, i) :0<i<r, t < 00} with values in the space of generalized functions 
on W 1 having the following properties: 

(i) For any T € [0, 00), we have u € M. p (r A T) and uq G L p (Q, JF , L p ); 
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(ii) There exist f l G L p (r), i = 0,...,d and g = (g l ,g 2 ,...) £ L p (r) such 
that for any <p E Cq with probability 1 for all finite t < r we have 

(u,,p) = («„,¥>) + / {-{flDw) + {fl<p))ds + Y, / (5-2) 
Jo k=1 Jo 

The reader can find in [K99] a discussion of (i) and (ii), in particular, 
the fact that the series in (|5,2p converges uniformly in probability on every 
finite subinterval of [0, r). On the other hand, it is worth saying that the 
above introduced space "Hp(r) are not quite the same as in |K99| . There are 
three differences. One is that there is a restriction on uq in [K99]. However 
the most important spaces are ^po( r ) w hich are defined as the subsets of 
7l!p(r) consisting of functions with no = 0. All other elements of H^t) are 
obtained by adding to an element of Hl (r) an appropriate continuation 
for t > of the initial data. Another issue is that in [K99J we have f % = 0, 
i = 1, ...,d, and f° € H~ 1 (r). Actually, this difference is fictitious because 
one knows that any / G Hp 1 

(a) has the form Dif 1 + f° with f 3 € L p and 

II/IIh-<^EII/ j II^, 

j=0 

where ./V is independent of /, / J , and on the other hand, 

(b) for any / G Hp 1 there exist / J E L p such that / = Dif 1 + f° and 



3=0 



\L P <N\\f\\ H - 



where ./V is independent of /. 

The third difference is that instead of (i) we require D 2 u € H~ 1 (r) in 
[K99J. However, as it follows from Theorem 3.7 of |K99j and the boundedness 
of the operator D : L p — > H~ , this difference disappears if r is a bounded 
stopping time. 

To summarize, the spaces Hz (r) introduced above and in [K99] coincide 
if r is bounded and we choose a particular representation of the deterministic 
part of the stochastic differential just for convenience. 

In case that property (ii) holds, we write 

du t = {Dif I + /°) dt + g k t dw k t (5.3) 



for t < t and this explains the sense in which equation (15. ip is understood. 
Of course, we still need to specify appropriate assumptions on the coefficients 
and the free terms in (|5.ip . Before we go to these assumptions we remind 
the reader that according to [K99] and the above discussion, for bounded r, 
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one introduces a norm in ( r ) by 

Hi** o( t) = e f T ( e p^iis, + E ii// hi + iwiy * 

Jo j=i j=o 
if u satisfies f)5 . 3|) . By identifying two elements of Hpoir) if their difference 
has a zero Hp^r )-norm, one obtains a Banach space (see [K99| ). 

We will also identify two elements u' , u" G Hp(i~) if and only if the differ- 
ence v! — u" is in Hp (r) and equals zero. 

Assumption 5.1. (i) The coefficients a?/, a|, 6*, crj fc , q, and v\ are mea- 
surable with respect to V X where B(M ) is the Borel cr-field on 

(ii) There is a constant -ftT such that for all values of indices and arguments 

\a\\ + \b\\ + \c t \ + \v\ i2 <K, c t < 0. 

(iii) There is a constant (5 > such that for all values of the arguments 
and £ G K rf 

~^)ee>m\ \$\<5-\ (5.4) 

where a] 3 = ( 1/2) (a 1 ', a r )i 2 . Finally, the constant A > 0. 

Assumption 15.11 (i) guarantees that equation (|5.ip makes perfect sense 
for any constant A if u G TLp{r). By the way, adding the term —Xut with 
constant A > is one more technically convenient step. One can always 
introduce this term, if originally it is absent, by considering vt := ute^ 1 . 

Assumption 5.2. There is a continuous function k{e) defined for e > 
such that k(0) = and 

\ot(x) - at(x)\ i2 + \al j (x) - a? \y)\ < n(\x - y\) 
for all i, j, t, x, y. 

Here are the main results used in the previous sections concerning (|5.ip . 
They are taken from [Ki04j and [K09]. Generalization of these results to the 
case of VMO coefficients a t 3 can be found in [K09J . 

Theorem 5.1. Let A > 0, let t be a stopping time, let f 3 ,g £ L p (r), and 
let ip be a function such that ip G Wp(r) n Elp(r). Then equation (15. ip on 
[0, r) has a unique solution u G Hp(r) such that uq = ipo. 
Write 

dip t = (Dial + a° t )dt + $ dm*. 
Then the above solution u satisfies 

^ 1/2 \\ U \\h p (r) + \\Du\\ hp{r) 

d d 

< N( J] ||/i| Lp (r) + \\9k p (r) + E W^Kir) + II/%p(t) + |M|hJ(t)) 
i=l i=l 
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+ iVA- 1 / 2 (||/ || Mr) + ||a°|| MT) + \M H(T) ) + iVA 1 / 2 ||^|| Lp(T ), (5.5) 

provided that A > Ao, where the constants N, Xq > depend only on d,p, K, 5, 
and the function k. 

Observe that estimate (|5.5p shows a good reason for writing the free term 
in (|5.ip in the form D{f l + /°, because f l , i = 1, and /° enter (|5.5p 
differently. 

Here is a result about continuous dependence of solutions on the data. 

Theorem 5.2. Assume that for each n = 1,2,... we are given functions 
a™ 3 , a™, b™, cf, af lk , v k , g^, and ijj n having the same meaning and 
satisfying the same assumptions with the same 5, K, k as the original ones. 
Assume that 

as n — > oo in measure P{dw)dtdx. Also let 

<ty» = {Diotf + af) dt + dw k t 
and assume that for a stopping time r 

d 

£(iir^iL pW + ii« nj - «1l pW ) 

3=0 

+h n - g\K(r) + WP n ~ PK(t) + \\r - ^Hhjm - o 

as n — > oo. Take A > Ao, take the function u from Theorem I5.il and let 
u n be unique solutions of equations (15. ip constructed from a™ 3 , a™, bf 1 , cf, 
o~t lk , v\, f™, and gf k and having initial values ip^. 
Then for any finite T > we have 

\\u n -u\\ m i( rAT) ^0, E sup \\ut - Ut\\ p L -> 

p t<rAT p 

as n — > oo. 

The following result shows that the solution does not depend on p. 

Theorem 5.3. Letp\,p2 £ [2, oo) and let the assumptions of Theorem \5.1\ be 
satisfied with p = p\ and p = P2- Then the solutions corresponding to p = p\ 
and p = P2 coincide, that is there is a unique solution u € Tih (r) fl Ti.p 2 (r) 
of equation (|5.ip with initial data i/jq. 

In many situation the following maximum principle is useful. 

Theorem 5.4. Under the assumptions of Theorem \5. 1\ suppose that ipQ > 0, 
/* = 0, i = 1, d, f° > 0, g = 0. Then for the solution u almost surely we 
have Ut > for all finite t < r. 



FILTERING EQUATIONS WITH LIPSCHITZ COEFFICIENTS 23 

Finally, we used the following embedding theorem (see Corollary 4.12 and 
Remark 4.14 of |K01| ). For k G (0, 1), a Banach space X, and a set A C R d 
by C K (A, X) we mean Holder's space of continuous X-valued functions on 
A with finite norm || • ||c K (yl,x) defined by 

[MloOMT) = sup \t - s\~ K \u(t) - u(s)\ x , \\u\\ C (A,x) = sup \u(t)\ x , 

s,t£A teA 
\\ u \\c K (A,X) = [\u\\c k (A,X) + IM|c(A,X)- 
Theorem 5.5. Let t <T , where the constant T G (0, oo) and let u £ 7Yp(r) 

satisfy (|5.3p with /■? € L p (t), g € ~L p (t), and «o £ Lp^,^, Hp 2 ^ p ), Then: 

(i) Almost surely ut is a continuous function oft with values in L p for all 
t€[0,r]. 

(ii) (case p > 2) Assume that for some numbers a and (3 we have 

2/p < a < (3 < 1. 

Then, for any a > 0, 

«U-V, (PlT] ^) * A^/V-I(a), (5.6) 

^K^"', * ^IKII^-, +^ /2 - 1 ^ 1 /(a), (5.7) 
where the constants N are independent of a, t, T , and u and 

1(a) := a||«||^ (r) + a^H Af + + ||< {t) . 

In particular, if p(l — (3) > d, then 

EsMu(-,x)r ca/2 _ 1/p{[0tT]) < NT^/Va^Iia), (5.8) 

E sup |Kf, Oll^-^-i/, < ^ll«(0)||* ' i_3 + NTPW-W-Hia). (5.9) 
te[o,r] w f 

Finally, (15.7P aZso /joWs if p = 2 and (3 = 1. 

It is probably worth saying that (|5.8p and (|5.9p are not stated in |K01| . 
These are just obvious consequences of (|5.6p and (|5.7p and the embedding 
theorem: iip C C^^ d / p if 7 — d/p > and 7 — d/p is not an integer. 
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